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Abstract 



The problem of separating structured information representing phenomena of differing 
natures is considered. A structure is assumed to be independent of the others if can be 
represented in a complementary subspace. When the concomitant subspaces are well 
separated the problem is readily solvable by a linear technique. Otherwise, the linear 
approach fails to correctly discriminate the required information. Hence, a non extensive 
approach is proposed. The resulting nonlinear technique is shown to be suitable for dealing 
with cases that cannot be tackled by the linear one. 

1 Introduction 

We consider the problem of discriminating information produced by phenomena of differing 
natures, via inverse methods. This involves the study of the physical state of a system by 
analyzing its response to some external interaction. We refer to the interactive carrier as input 
signal and to the system's reaction as signal response. Unfortunately, a particular response is 
not always directly available, as one may receive it 'disguised' by the interference with another 
independent phenomenon not being the focus of specific interest. In this paper we restrict 
our consideration to responses evoked by statistical systems. By this we understand systems 
which are fully characterized by of a probability distribution indicating either the population 
of subsystems compressing the whole system, or the degree of uncertainty about the system 
being in one of its possible states. We regard both situations to be identical in the description 
and refer to subsystems as system's states. 

In order to formulate the problem let us use the label H\ ranging from 1 to M, to denote the 
i-th state of a system which is characterized by a probability Pi. Adopting Dirac's notation we 
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indicate by a ket |/v) the system's response to some input signal and by \vi) the corresponding 
response of the i-th state. Consequently, the system's signal response satisfies 



M 



l/v) OC ^Pi\Vi). 



i=l 



This equation is transformed into an equality by simply relaxing the condition J2i=iPi = 1, so 



where the coefficients in the superposition are not necessarily normalized to unity. As already 
stated, we are interested in the problem of discriminating |/v) from a given signal |/) of which 
l/v) is a component. Out of the many situations involving this problem it immediately comes 
to our mind the intensity of X-rays produced simultaneously by dispersion and diffraction or 
an infrared emission spectrum superimposed to blackbody radiation. In order to model all 
relevant cases we assume that, rather than |/v), the available signal is |/) = |/v) + |/w-l); 
where |/w-l) is produced by an independent phenomenon. We focus on those cases ensuring 
a unique decomposition, i.e., wc further assume that the subspaccs hosting the components 
l/v) and I/w-l) are complementary. However, the focus of our interest refers to complementary 
subspaces being close enough together to move the problem of separating the components far 
away from the trivial one. Certainly, if the subspaces hosting the signal components are well 
separated, the problem is readily solvable by means of an oblique projection onto one of the 
subspaccs and along the other [1,2]. Contrarily, if the subspaces are not well separated the 
construction of the necessary projector becomes ill posed and the problem needs to be tackled 
in an alternative way. In this Communication we address the matter by including a hypothesis 
upon the system producing the signal response. We assume that the population of states is 
K -sparse in the sense that, out of the M possible states of the system, only K < M of them are 
characterized by a significant probability. Nevertheless, the hypothesis generates, in general, 
an intractable problem, because of course the populated states are unknown and the number 
of possibilities of populating K states out of M is a combinatorial number (^) . This makes 
the exhaustive search for the unknown states an impossible task for most values of M and 
K. In recent pubhcations [3, 4] a greedy strategy for making the search tractable has been 
proposed. In the present context, the proposal of that publications implies to assume a priori 
that no state is populated and looks for the populated ones in a stepwise manner. Here we 
investigate the possibility of addressing the problem from the opposite view point. Assuming 
a priori that all the states are equally populated, we will determine the actual population of 
each state via the minimization of the g— norm like quantity X^^^ IqI^, < g < 1. The 
minimization of this quantity as an appropriate criterion for determining a sparse solution to 
an under-determined linear system is discussed in [5,6]. For nonnegative and normalized to 
unity coefficients Cj,i = 1, . . . , M, this quantity is closely related to the non-extensive entropic 
measure broadly applied in physics [7-12] since Tsallis introduced it as the essential ingredient 
of his thermodynamic analysis framework [7]. In the present context the value of q plays a 
particular role. By choosing < g < 1 wc introduce an assumption on the sought distribution. 
We assume that not all the possible states in a system 's model are significantly populated. This 
assumption is meant to compensate for the actual ovcrcstimation of possibilities one usually 
makes when a system's signal response is modelled mathematically. 
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The paper is organized as follows: Section [2] introduces the mathematical setting of the 
problem and discusses the construction of oblique projectors. Section [3] remarks the need for 
nonlinear approaches to separate signal components living in subspaces which are 'theoretically' 
complementary, but close enough to prevent the components discrimination being realized by a 
linear operation. The proposed strategy, based on the minimization of the g-norm"? YlfLi l^il'^j 
subject to recursively selected constraints, is discussed in Section H] and illustrated in the same 
section by a numerical simulation. The numerical experiment is especially designed to highlight 
the robustness of the proposed approach against significant error in the data. The conclusions 
are presented in Section [5l 

2 Mathematical setting of the problem 

As already mentioned, adopting Dirac's notation we represent the response of a statistical 
system to some external interaction as |/v), which is expressible in the form 

M 

|/v) = $^c,h). (1) 

1=1 

Since the kets are elements of an inner product space, their square norm is induced by the inner 
product, i.e., |||/v)||2 = (/v|/v). 

The problem we are concerned with entails to 'rescue' a ket response |/v) from an available 
mixture |/) = |/v) + l/w^)) where \fw±) is produced by an independent phenomenon (e.g. a 
structured interference that one would call background referring to a persistent effect out of the 
focus of the main interest). 

Denoting V = spa.n{\vi)}fii and assuming that the subspace such that l/w^) £ VV"*" 
is known, we restrict considerations to the case V fl = {0} so as to ensure the uniqueness 
of the decomposition |/) = |/v) + \fw±). Such a problem has a straightforward 'theoretical' 
solution. Certainly, from Ey^±, the oblique projector onto V along W"*", one immediately has 

Evw^lf) = Kw^{\fv) + |/w4) = l/v). 

However, as will be discussed in the next section, when the subspaces V and W"*" are not well 
separated the numerical construction of -Eyw^ becomes ill posed, thus preventing the signal 
separation to be correctly realized. 

2.1 Construction of Oblique Projections 

Let us recall that every idempotent operator is a projector. Hence, an operator E is a, projector 
if & = E. The projection is along its null space and onto its range. When these subspaces are 
orthogonal E is called an orthogonal projector, and it is the case if and only if E is self-adjoint. 
Otherwise it is called an oblique projector. For a good and amusing introduction to oblique 
projectors in the context of signal processing we refer to [13] and for advanced theoretical study 
of oblique projector operators in infinite dimensional spaces to [14,15]. Here we will restrict 
ourselves to issues related to numerical constructions. 

Assuming that V fl = {0} the oblique projector operator onto V along W"*" will be 
represented as above. Then -Eyw-L satisfies -Ey-^x = E^y^± and, consequently, 

Evw^lg) = \9), if \9)eV 
EvwAa) = 0, if \g)eyv^. 
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In the particular case for which = V"*" the operator Eyy± is an orthogonal projection 
onto V. For indicating an orthogonal projector onto a subspace, X say, we use the particular 
notation Px- 

Given V and W"'", in oder to construct Eyy^± we define S as the direct sum of V and W"'", 
which we express as 

s = v®w^. 

Let W = (W^)^ be the orthogonal complement of in S. Thus we have S = V (B = 
W W^, where the operation indicates the orthogonal sum referring to the direct sum 
of orthogonal subspaces. Assuming that a set {yj}'j=i spanning W-^ is known, we can always 
construct the orthogonal projector Pyyx to be expressed in the form 

J' 

where vectors {\oj)}jL-^ span W"*" and are orthonormal, while the given set {|yi)}/=i is not 
necessarily orthogonal, nor even linearly independent. 

Prom the set spanning V, a spanning set for W is readily obtained as 

\ui) = \vi) - Av-L^i) = Avki), i = 1, . . . ,M. (2) 

Denoting by ^^e standard orthonormal basis for C-^, operators V : — > V and 

U axe defined as 

M M 



Ui 



i=l i=l 

Consequently, the adjoint operators U* and V* are 

M M 

i=i 1=1 

Since Py^V = U and U*Pyv = U*, the operator G : ^ given below 

G = U*V = U*U 

is a self-adjoint operator. The elements of its matrix representation are 

{i\G\j) = {ui\vj) = {ui\uj), i,j = 1, . . . ,M. 
In terms of the above defined operators the oblique projector Eyy^± is expressed as 

Eyy^± = VG^U* (3) 

or, equivalently, 

M 

Evw^ =Y\vi){wi\, (4) 



i=l 
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where 

M M 



[Wi 



{^m* = yu\G^j){u,\ = yoUu,\. (5) 



with ^fj,^- = {i\G^j) the element (i, j) of a matrix G"^ indicating the pseudo inverse of G. It 
is actually straightforward to verify that Evw± given in (jl]) satisfies the required properties. 
Namely, i)-E^^x = E^w^, ii) -^vw^l/v) = l/v), for all |/v) G V, and iii)Evyi;± l^i) = or all 

Note: The condition V fl W"*" = {0} implies that the dimension of V is equal to the 
dimension of W. Accordingly, if the spanning set is linearly independent, operator G 

has an inverse. Nevertheless, the independence of {l^j)}^]^ is not required, so that an inverse 
for G need not exist. For the sake of generality we use G^, which it is equal to G~^ when such 
an inverse does exist. 

Let us stress that, since operators V and U are given in terms of spanning sets for the spaces 
V and yV, respectively, any such spanning sets can be used. This possibility yields a number 
of different ways of computing -E'vw^; all of them, of course, theoretically equivalent but not 
necessarily numerically equivalent when the problem is ill posed. 

Considering that iV'n) e C^, n = 1, . . . , M, are the eigenvectors of G and assuming that 
there exist nonzero eigenvalues A^, n = 1, . . . , A^, on taking these eigenvalues in descending 
order we can express the matrix elements of the Moore-Penrose pseudo inverse of G as: 

^ 1 

= T.\'^n)^m. (6) 



A, 

n=l 



Moreover, the orthonormal vectors 



IU = ^^, ^n = Vx'n, n = l,...,N (7) 

are singular vectors of U*, which satisfy U*\^n) = o'nl'^n), as it is immediate to verify. By 
defining now the vectors |?7n), ^ = 1, • • • , A^ as 

\Vn) = ^, n = l,...,N, (8) 
the projector -Eyyy± in (jl]) is recast as 

N 

Evw^ = y,\Vn)m- (9) 

n=l 

Proposition 1. The vectors G W, n = 1, . . . ,N and |?7„) G V, n = 1, . . . , A^ given in (j7]) 
and (jHj) are biorthogonal to each other and span W and V, respectively. 

The proof of the above proposition is given in Appendix A. 

It is immediate to verify that the representation (jlj) of E^y^,± also arises from (j9]), since 



^ 1 

\W^) = J2\^n) — {^n\^), ^ = 1, . . . , M. (10) 
n=l 



5 



Moreover, the representation (j4]) can be numerically realized in different ways by using different 
spanning sets to compute the operator G. Indeed, by orthogonalizing {\ui)}fLi to obtain 
the orthogonal set {|gi)}fii, M' < M also spanning W, the matrix elements of operator G„ 
(replacing G) are given as {qi\vj) (or equivalently as {qi\uj)) i, j = 1, . . . , M. Thus, vectors {wi\ 

in (jl]) are calculated as {wi\ = '^fLiiA^g^ \j){Qj\■ 
li spaces V and W"*" are not too close, which is reflected by the fact that the non zero 
singular values of If are not too small, all the constructions of Ey^± are equivalent. However, 
as will be discussed in the next section, the existence of small singular values may render all the 
possible numerical constructions of -Eyy^± incapable of producing the expected signal splitting 
by the operation E^y^;±\f). 

3 The need for non-linear approaches 

This section is dedicated to illustrating, by recourse to a numerical example, the crucial role 
that nonlinear approaches could play for the success of discriminating signal components when 
the concomitant linear problem is ill posed. 

Numerical Example Let V be the cardinal cubic spline space with distance 0.01 between 
consecutive knots, on the interval [0, 1]. This is a subspace of dimension M = 103, which we 
span using a B-spline basis 

B = {B,ix),xe[OA]}Z\. 

The functions Bi{x) in B are obtained by translations of a prototype function and the restriction 
to the interval [0,1] [16,17]. A few of such functions are plotted in the left hand graph of 
Figure 1. Here the inner product is defined as {f\h) = f{x)*h{x) dx, and all the integrals 
are computed numerically. 



0.036 




Figure 1: Left graph: cubic B spline functions, in the rage x G [0.1,0.3], from the set spanning the 
space of the signal response. Right graph: three of the functions spanning the space of the background. 

Randomly taking 30 B-splines {-B^jf^i from B we simulate a spectrum by a weighted 
superposition of such functions, i.e., the response signal is modelled as 

30 

{x\fv) = Mx) = J2^Mx), X G [0,1], (11) 

i=l 
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with the coefficients randomly chosen from [0, 1]. 

We simulate a background by considering that it belongs to the subspace W"*" spanned by 
the set of functions 

Y = {y,{x) = ix + O.OlJr°•°'^ X e [0, 1]}%,. 

A few functions from this set are plotted in the right hand graph of Figure 1 (normalized to 
unity on [0, 1]). The background, g{x), is generated by the linear combination 

50 

{x\g)=g{x) = J2fe-'-'''yA^)- (12) 
i=i 

To simulate the data we have perturbed the superposition of (fTTi) and (fT2|) . by 'very small' 
Gaussian errors (of variance up to 0.00001% the value of each data point) and plotted the 
simulated data in the left hand graph of Figure 2. 

This example illustrates well how sensitive to errors the oblique projection is. The subspaces 
we are dealing with are disjoint: the last five singular values of operator U* (c.f. ([7])) are: 

0.3277, 0.3276, 1.0488 x 10-^ 6.9356 x 10"^ 2.3367 x 10"^°, 

while the ffist is ai = 1.4493. The smallest singular value cannot be considered a numerical 
representation of zero when the calculations are being carried out in double precision arithmetic. 
Hence, one can assert that the condition V fl = {0} is fulfilled. However, due to the three 
small singular values the oblique projector along W"*" onto the whole subspace V is very unstable, 
which fails to correctly separate the signals in V from the background. The result of applying 
the oblique projector onto the signal of the left hand graph is represented by the broken line 
in the right hand graph. As can be observed, the projection does not yield the required signal, 
which is represented by the continuous dark line in the same graph. Now, since the spectrum 
of singular values has a clear jump (the last three singular values are far from the previous 
ones) it might seem that one could regularize the projection by truncation of singular values. 
Nevertheless, such a methodology turns out to be inappropriate for the present problem, as it 
does not yield the correct separation. 

Proposition [2] below analyzes the effect that regularization by truncation of singular values 
has on the resulting projection. 

Proposition 2. Truncation of the expansion ([9]) to consider up to r terms, produces an oblique 
projector along VVr = + VVo + Vq, with Vq = span{\r]i)}f^j.^^ and VVo = span{\^i)}fLJ._^_l, 
onto Vr = span{\r]i)}l^^. 

The proof of this proposition is to be found in Appendix B. 

The above example illustrates very clearly the need for nonlinear approaches. We know that 
a unique and stable solution does exist, since the signal which is to be discriminated from the 
background actually belongs to a subspace of the given spline space, and the construction of the 
oblique projectors onto such a subspace is well posed. However, the lack of knowledge about 
the subspace prevents us from separating the signal components by a linear operation. The 
greedy approaches that have been proposed for making tractable the search for the unknown 
subspace proceed in a stepwise manner [3,4]. Within those approaches, all the effort is focussed 
on the search for the right subspace for recursively constructing and adapting the vectors \wi) 
(c.f. (jl])). Conceptually, the proposal in [4] implies to assume a priori that none of the system 
states is populated and uses the available signal to determine which are the populated ones. 
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Figure 2: Left graph: signal plus background. Right graph: the dark continuous line corresponds 
to the signal to be discriminated from the one in the left graph. The broken line corresponds to the 
approximation resulting from the oblique projection. The three close light lines correspond to the 
approximations obtained by truncation of one, two, and three singular values. 

Here we wish to investigate the outcomes yielded by the converse prior assumption, i.e., by 
considering a priori that all the states are equally populated and use the available signal to 
learn which are the non-populated ones. 

4 The proposed nonlinear approach 

We start by recalling the available strategy for transforming the problem of discriminating 
the system's signal response |/v) from a given signal |/) = |/v) + |/w^) i^^to the problem 
of constructing the sparse representation of |/v) in V. Let us stress, once again that i) the 
problem we need to face arises from the ill-posed feature of the oblique projectors onto the 
whole subspace V and ii) we work under the hypothesis that there exists an unknown subspace 
Vk = spaii{|f^-)}f£]^ C V, where {£i}^i is a set of K unknown indexes such that 

K 

\fv) = \M = Y.''M- (13) 

1=1 

Equivalently, ( |T3|) can be expressed in the form 

M 

l/v) = l/v,) = Yl ^^1^*)' c, = Oifz^i„j = l...,K. (14) 

i=l 

Hence, to find the subspace Vk is equivalent to finding the sparse representation of |/v) in V, 
i.e. a representation given by (IT^ where only K coefficients are nonzero. However we need to 
allow for the fact that we do not have access to the signal |/v) but only to the signal |/). As 
proposed in [4] , we can deal somehow with this lack of information by noticing that by applying 
the projector Pvv both sides of IHM we have 

M 

l/w) = I/wk) = ^Cilui), (15) 

i=l 
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where |/wa-) = PwIIvk) ^^nd |/w;) = Pw|/)- Denoting by Is the identity operator in S the 
projector P>v is obtained as Pw = -^5 ^ Av^- Thus, since the subspaces iS and W"*" are known, 
we do have access to the component l/w). Because the coefficients in (|T^ and (|T5l) are identical, 
one can find the sparse representation ([T3j) by finding the sparse representation (|T5l) and using 
the resuhing coefficients in f|T^ . 

At this point we begin to differ from the proposal in [4]. While in that publication the 
problem is tackled by a greedy stepwise search for the indices ii, i = 1, K in f|T3|) . here 
we take a different route and strive to find the sparse solution of (|T5|) by minimization of the 
g— norm like quantity 

M 

|||c)||^ = with 0<g<l. (16) 

1=1 

The minimization of the g-norm'^ for determining a sparse solution has been studied in Math- 
ematics and Signal Processing and justified by the following consideration. The problem of 
finding the sparsest representation of a given model is equivalent to minimization of the zero 
norm ||c)||o (or counting measure) which is defined as: 

M 

ll|c)||o = El^^l° 

i=l 

and therefore is equal to the number of nonzero entries of |c). The minimization of |||c)||o 
subject to linear constraints is a classical problem of combinatorial search, which is in general 
NP-hard [18]. Thus, the minimization of Y2f^i |cj|^, for < g < 1 has been considered [6]. 
However since the minimization of X^illi 1^1'^) < g < 1 does not lead to a convex optimization 
problem, the most popular norm to minimize, when a sparse solution is required, is the 1- 
norm YliLi Minimization of the 1-norm is considered the best convex approximant to 
the minimizer of |||c)||o [19,20]. Moreover, it can be efficiently solved by linear programming 
techniques [19]. Since the problem of signal separation we are considering admits a unique 
solution, we are not particularly concerned about convexity. Hence we will set up our numerical 
strategy letting the parameter g take any value in (0, 1]. 



4.1 Managing the constraints 

The optimization process we consider is stated as follows: Given the constraints fllSp minimize 

Now, in general, in order make use of constraints f|T5l) we need a numerical representation of 
l/w), which in practice is obtained by experimental measures. Thus, while restricting consid- 
erations to linear measurements we represent them as linear functionals, which, as established 
by Riesz' theorem [21], are amenable to representation by inner products with some vectors. 
Accordingly, we express measures on |/) by the inner products 

^j = {^j\f)^ j = !,•••, A^- 

The specification of the measurement vectors \mj), j = 1, . . . , N should be given in each par- 
ticular case. The ones considered here have been chosen in relation to the examples we are 
presenting. Firstly, to simulate the observed data we suppose that the measures are performed 
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by varying some parameter (e. g. time, wavelength, temperature) that is denoted as the 
variable 'x' discretized at the points Xj, j = 1, . . . , iV to obtain the measures 

fwixj) = {xjlfw), j = l,...,N 

and the corresponding linear functionals 

{xj\ui) = {xj\Pw\vi), , j = 1,...,N 

from the state's signal responses \vi), i = 1, . . . , M. While the functionals {xj\ui) are modeled 
according to physical considerations, the values fw{xj) are experimental data, thereby affected 
by errors. We then use the notation fy^{xj), j = l,...,iV to indicate the observations of 
fwixj), j = 1, . . . ,N. Consequently, rather than reproducing the data fy\;{xj), j = 1, . . . ,N 
we request that the model given by the r.h.s. of (fT5|) satisfies the restriction 

N 

E(/w(^.) - fw{x,))' < S, (17) 

S accounting for the data's error. The stated optimization process subjected to this constraint 
is numerically difficult to realize. Nevertheless, we show here that the available information can 
be handled so as to successfully achieve the discrimination of signal components, even when 
the data errors are significant. For this we make use of an idea we had introduced much earlier, 
in [22], and applied in [23]: Replacing fw{xj) by f|T5|) . the condition of minimal square distance 
S!Li(/vv(^i) ^ fw{xj)y leads to the so called normal equations: 



M 



(«n|/^) = J]q(m„|u,), n = l...,M. (18) 

i=l 

Of course, since we are concerned with ill posed problems we cannot use all these equations to 
find the coefficients q, i = 1, . . . , M. However, as proposed in [22], we could use 'some' of these 
equations as constraints of our optimization process. The number of such equations being the 
necessary to reach the condition (ITTl) . 

We have then transformed the original problem into the one of minimizing (I16p subject to 
a number of equations selected from (fTSl) . the C-th, n = 1 . . . ,r ones say. We leave for the 
moment the restrictions Cj > 0, z = 1, . . . , M. We should worry about them only if they were 
not satisfied. 

In line with [22] we select the subset of equations f|T8|) in an iterative fashion. We start by 
the initial estimation Cj = C, i = 1, . . . , M, where the constant C is determined by minimizing 
the distant between the model and the data. Thus, 

^ = ^#f#^- (19) 

With this initial estimation we 'predict' the normal equations ( JTSj) and select as our first 
constraint the worst predicted by the initial solution, let this equation be the £i-th one. We 
then minimize (fT6|) subject to the constraint 

M 

Mfw) = ^Ci{ui,\ui), (20) 
1=1 
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and indicate the resulting coefficients as ci \ i = 1, . . . , M. With these coefficients we predict 
equations ( fT8|) and select the worst predicted as a new constraint to obtain , i = 1, . . . , M 
and so on. The iterative process is stopped when the condition (fT7|l is reached. 

The reader may be aware that the proposed strategy involves highly nonlinear equations 
and in many situations the number of necessary constraints is large enough to generate a trou- 
blesome numerical task. However, we have been able to solve the simulation of the next section 
(comprising up to 57 constraints) by recourse to the method for minimization of the (g-norm)'' 
published in [6]. Such an iterative method, called FOCal Underdetermined System Solver 
(FOCUSS) in that publication, is straightforward implementable. It evolves by computation 
of pseudoinverse matrices, which under the given hypothesis of our problem, and within our 
recursive strategy for feeding the constraints, are guaranteed to be numerically stable (for a 
detailed explanation of the method see [6]). 

4.2 Numerical Simulation 

We test the proposed approach, ffist on the simulation of Example 1 of Section 3, and then 
extend that simulation to consider a more realistic level of uncertainty in the data. Let us 
remark that the signal is meant to represent an emission spectrum consisting of the superpo- 
sition of spectral lines (modeled by B-spline functions of support 0.04) which are centered at 
the positions (n — 1) A, = 0, . . . , 102, with A = 0.01. Since the errors in the data in Example 
1 are not significant, the procedure outlined in the previous section accurately recovers the 
spectrum from the background, with any positive value of the g-parameter less than or equal 
to one. The result (coinciding with the theoretical one) is shown in the right hand top graph 
of Figure 3. 

Now we transform the example into a more realistic situation by adding larger errors to 
the data. In this case, the data set is perturbed by Gaussian errors of variance up to 1% of 
each data point. Such a piece of data is plotted in the left middle graph of Figure 3 and the 
spectrum extracted by the the proposed approach is represented by the broken line in the right 
middle graph of Figure 3, which is difficult to differentiate from the theoretical one (continuous 
line) . 

Finally we increase the data's error up to 3% of each data point (left bottom graph of 
Figure 3) and, in spite of the perceived significant distortion of the signal, we could still recover 
a spectrum which, as shown by the broken line in the right bottom graph of Figure 3, is a fairly 
good approximation of the true one (continuous line). We have repeated the experiment for 
different realization of the errors (with the same variance) and the results remained essentially 
equivalent. Moreover, we have considered other realizations of the experiment by drawing 
different spectra through the process described in Section 3. By observing the outcomes of a 
number the different realizations we can assert that the quality of the results shown in Figure 3 
is a fair representation of those obtained for different spectra. Variation of the g-value did not 
produce significant changes. The results of Figure 3 were obtained for the value q = 0.8. The 
number of equations that were necessary to use in order to reach the stopping criterion for the 
different level of error were: K = 57 for the data in the top graph, K = 51 for the data in the 
middle graph and K = A3 for the bottom graph. 

It is appropriate to stress once more that for small level of errors the solution of this example 
is unique. The numerical experiment illustrates the fact that, for the degree of sparsity being 
considered (out to 103 states only 30 are populated) the solution can be reached by the whole 
range of q values in (0, 1]. This is so because for all q in the range (0, 1] the number of constraints 
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Figure 3: Top left graph: signal plus background generated as described in Example 1 of Section 
3. Top right graph: Recovered system' signal response, which coincides with the true one. Middle 
left graph: signal of Example 1 distorted by Gaussian errors of variance up to 1% of each data value. 
Middle right graph: the broken line represents the approximation of the system' signal response 
yielded by the proposed approach. The continuous line represents the true signal. Bottom graphs: 
Same description as in the previous graphs but the data distorted by Gaussian errors of variance up 
to 3% of each data value. 



which are needed to obtain the solution is still small enough to yield a well posed problem. Let 
us stress further that, we have not made explicit used of the constraints Ci > 0, i — 1, . . . , M 
but look for the solution by minimization of the quantity J^^^ |ci|^, which is non-extensive for 
all g-values. 
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5 Conclusions 



The problem of discriminating information produced by phenomena of different nature has been 
addressed through a non-extensive nonhnear approach. The proposed framework is founded 
on the minimization of a g— norm hke quantity. It is appropriate to remark that our main 
concern was to reahze the discrimination of information components in cases admitting a unique 
theoretical solution. The problem was transformed into an underdetermined linear one, due 
to the numerical instability of the concomitant full rank problem. The resulting approach has 
been tested by recourse to a numerical example which cannot be handled by linear techniques 
(even for unrealistically high quality data). A detailed analysis of the limitation affecting the 
linear technique has been provided. The nonlinear approach presented here was shown to be 
capable of overcoming those limitations. It has correctly realized the required task, even for 
data distorted by significant random errors. We are aware that further studies may be in order 
and we are confident that the results presented here will motivate future works. 
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A. Proof of Proposition [T] 

Proof. Using ([7]) and ([8]) we have 

{^m\Vn) = -^—{i'n\U*V\^ni) = ^n,m^^ = (^n.m, 

which proves the biorthogonality property. 

The proof that span{|,^„)}^^i = W stems from the fact that W = span{|uj)}fii = span{|wj)}^£i , 
which allows us to express an arbitrary \g) G W as the linear combination l^f) = X]j=i 
Then, using ffTOl) . we have \g) = J2n=i^ri\^n) with a„ = ^ X]f=i '^^(^iV'n), which proves that 
W C span{|^j)}^;^. On the other hand for l^f) G span{|^j)}^]^ we can write l^f) = J2n=i'^n\^n) 
and using ([7j) we have /) = J2i=i ^il'^i) ^ with di = J2n=i ^nli^nii)) ■ This proves that 
spanjl^j)}^]^ C W and therefore span{|^„)}^^j^ = W. The proof that span{|?7„)}^^;^ = V 
is equivalent to the previous one. □ 



B. Proof of Proposition [2] 

Proof. The biorthogonality between {|^)}[^^ and {|r7j)}[^^ established in Proposition [T] ensures 
that E^^^^ = Xll=i h){^i\ is a projector, since E^^^^ = ^y.w.- 

As estabhshed in Proposition [T], V = span{|?7j)}^^, and therefore every |/) G V can be 
decomposed as |/) = |/r.) + |/o) with 1/^) G span{|?7j)}[^i and \ fo) G spaia{\r]i)}fL^^^. Moreover, 
EvrwJJ) = \fr),Ey^vvJfr) = fr), and E^^^J/o) = 0, which proves that the projection is 
onto Vr and Vo is included in the null space of -E-p^yy . Equivalently, for every \go) G Wo = 
span{|^j)}^j._,_;^ we have Ey^y^j^ \go) = 0, because the set is orthonormal. Thus, VVq is 

included in the null space of yy . □ 
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